TONE DETECTION 



FIELD OF THE INVENTION 

This Invention relates to detecting the tones used for signaling over the 
public switched telephone network (PSTN). 

BACKGROUND OF THE INVENTION 

Two types of tone signals are used for signaling in the PSTN. From a 
subscriber's premises to the central office, DTMF signaling employs two 
sinusoidal signals, one from a low frequency set of four "row" tones and one from 
a higher frequency set of four "column" tones to transmit any of the 10 digital 
values, the symbols "star' and "pound", or the additional symbols "A", "B", "C", 
and "D". For example, the digit "5" is represented by tones having frequencies of 
770 Hz and 1336 Hz. The frequencies used are not spaced at equal increments 
but rather are deliberately designed to be uncorrected, i.e., the frequencies are 
not exact multiples of a common base frequency. A different set of tones spaced 
at equal intervals of 200 Hz is used by the telephone company for MF signaling 
among central offices. All possible combinations of two frequencies are used 
resulting in a 15 distinct tone pairs and unlike DTMF, there are no "row" or 
"column" associations. 

In either situation, the accurate identification of the tones that are present 
is a process that may have to be carried out under less than ideal conditions. For 
example, undesired impulse and/or Gaussian noise, induction from power lines 
as well as components of dial tone, speech and echoes of the tone signals may 
be present in the telephone loop and should not be mistaken for DTMF signals. 
In addition, both the accuracy of the generated frequencies as well as their 
relative amplitudes may vary and the tone signals themselves can be distorted by 
loss, and non-linear distortion. Various standards have been promulgated such 
as Telcordia Generic Requirement GR506 Core - Chapter 15 and CCITT 
recommendations Q.23 and Q.24 in the "Red Book", Volume VI that detail the 
bounds of acceptable DTMF signals While GR506 Chapter 16 and ITU Q.455 
cover MF tone detection. These standards involve various criteria, such as 
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frequency distortion allowance, twist allowance, noise immunity, guard time, talk- 
down, talk-off, acceptable signal to noise ratio, and dynamic range, etc. A DTMF 
detector is required to detect a transmitted signal that has a frequency distortion 
of less than 1.5% and should not detect any DTMF signals that have frequency 
5 distortion of more than 3.5%. The term "twist" refers to the difference, in decibels, 
between the amplitude of the strongest key pad column tone and the amplitude 
of the strongest key pad row tone. For example, the AT&T standard requires the 
twist to be between -8 and +4 decibels. Different noise immunity criteria require 
that signals must meet a specified signal to noise ratio (SNR) which may range 
10 from 12 to 24 decibels. A guard time check criteria requires that tones having a 
duration greater than 40 milliseconds be detected but those having a duration 
less than 20 milliseconds not be. Speech immunity refers to the ability of the 
DTMF detector to accurately distinguish DTMF tone signals from actual speech. 
The discrete Fourier transform (DFT) running in a digital signal processor 
15 can be useful for detecting the frequency components of waveforms such as tone 
signals. For example, US patents 5,588,053 and 5,644,634 issued to Xie use the 
DFT to determine the energy level of the different frequencies present in 
waveforms being analyzed for the presence of DTMF tone signals. The energy 
levels of the frequency components are compared with predetermined thresholds 
20 to determine whether the tones that are present comply with a preset standard. 
Because the frequencies employed in DTMF signaling are not exact multiples of 
a common base frequency, using a fixed number of waveform samples in the 
DFT to detect any of the DTMF tones that may be present will cause the signal 
energy of a tone to be distributed across the spectrum possibly leading to an 
25 en-or in tone detection. The Xie patent teaches that the DFT can avoid the energy 
distribution problem by using different frame lengths of from 168 to 178 samples 
depending on the frequency being looked for in the sampled waveform. It would 
be extremely advantageous to be able to use a single frame size having a 
reasonable number of samples and still obtain detection of the tone pairs within 
30 the limits of accuracy required by any of the aforementioned standards. 
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SUMMARY OF THE INVENTION 

In accordance with the principles of the Invention, as exemplified by the 
illustrative embodiment, a fixed frame containing N samples of data is used. A 
5 DFT forms sine and cosine dot products of the data samples in the frame with 
frequencies that approximate each of the tones of the set desired to be detected 
(e.g., eight DTMF frequencies). Instead of multiplying by the exact tones of the 
set, the closest multiple of a base frequency determined by the quotient of the 
sampling frequency (typically 8000 Hz) and twice the number of samples Fs/2N 
10 is used. A nominal or "expected" phase change occurs between successive 
frames caused by the difference between a tone frequency of the set and the 
integer multiple of the base frequency used in forming the dot products. The 
power of a detected tone is determined by the sum of the squares of the in-phase 
and quadrature dot products (1^ + Q^). The arctan (Q/l) is computed for the two 
15 highest power tones on successive frames, and the phase difference between 
successive frames is computed. From the computed phase differences the 
"expected" phase change is subtracted, to calculate the frequency deviation of 
the tone. The expected phase change is mapped into the range of -PI to PI by 
taking the expected phase modulo 2*PI. If the deviations exceeds the 
20 aforementioned specifications, the detection is declared to be Invalid. 

Computation of the arctan function for phase measurement is 
advantageously performed using fixed point arithmetic available in a digital signal 
processor. For each of the two frequencies of greatest power, the arctan (0,1) 
function returns a four quadrant result between -PI and PI in order to be able to 
25 find actual phase differences. Phase is arctan(Q,l). Previous phase is the phase 
computed in the previous frame, which is used to compare with the cun-ent phase 
to determine the change over the interval of a single frame. 
DeltaPhase = Phase - Previous Phase 
PhaseDeviation = abs(DeltaPhase - ExpectedDeltaPhase); 
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These subtract operations are done modulo 2*PI and the results are forced to lie 
within -PI and PI by subtracting 2*Pi from any phase value greater than PI and 
adding 2*Pi to any phase value < -PI. 

Further in accordance with the invention, the amplitude of an observed 

5 tone is adjusted for the sin(x)/(x) roll off of the response according to the 
deviation (x) of the observed tone from the appropriate tone of the set. The nulls 
in the magnitude response occur at the center frequencies of the other bins in the 
frequency set of the DPT. The deviation "x" is used to compute a (x)/sind(x) 
multiplicative adjustment to the computed power. With the results of the 

10 aforementioned operations, decision functions are executed to determine 
whether the tones detected are valid with respect to power, signal to noise ratio, 
twist, stability throughout the requisite intervals. 

BRIEF DESCRIPTION OF THE DRAWING 
15 The foregoing and other objects and features of the invention may 

become clearer from a reading of the ensuing detailed description together with 
the drawing in which: 

Fig. 1 is a block diagram of the frequency detection process; 
Fig. 2 is a block diagram of the decision function which determines 
20 whether a detected frequency meets defined criteria; and 
Fig. 3 is the decision function state machine. 

GENERAL DESCRIPTION 

The discrete Fourier transform is a technique that involves the sampling of 
25 a waveform and summing the products of the samples with a succession of 
different complex sinusoids. The shorter the time interval between samples and 
the greater the total number of samples the more accurate the analysis. For a 
length N complex sequence x(n), n = 0, 1,2, N-1, the discrete Fourier 
transform (DFT) is defined by : 
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XK)^£x(/7)e-^'^^'"=X^(")e /f = 0.1,2,...A/-1 

where: tn = nT, the nth sampling instant (sec), where T is the time interval 
between samples of the waveform; 

cok = kth frequency sample (rad/sec), where Q is the frequency 
5 interval between frequency samples used in the summation; 

T = 1/fs , the time sampling interval (sec); and 
a = 271 fs/N, the frequency sampling interval (sec). 

The transform kernel, e"-'*"*^" = cos (%?„)-; sin {a^in)^ consists of samples of a 
complex sinusoid at N discrete frequencies ©k, uniformly spaced between 0 and 

10 the sampling rate ©s = 27tfs. Because the DFT uses evenly spaced frequency 
samples, i.e., integer multiples of a common base frequency, a problem can arise 
when the waveform being sampled contains frequencies that are not integer 
multiples of the base frequency used in the DFT. 

In the telephone industry the wavefomn sampling frequency fs is usually 

15 fixed at 8000 per second which limits the detection of the constituent frequencies 
in the waveform to those of 4 KHz and below. Since the sampling rate is fixed, 
the remaining transform parameters to be selected for use are the number of 
waveform samples N to put in the "frame" and the succession of k frequencies ©k 
to be used in forming the N dot products with the samples of x(n). Once N and U 

20 have been chosen, the interval between frequency samples, Q = 27i fs / N, is 
determined. 

In the prior art some attention has been paid to the situation where the 
frequencies being looked for are not integer multiples of a common base 
frequency (Ok • US patent 5,644,634 attempts to deal with the problem by using a 
25 number of samples N, i.e., a "frame" length that depends on the frequency being 
looked for. In accordance with the present invention, however, it is not 
necessary to use different length frames. Instead, it has been appreciated that 
while the use of a common frame for all samples of the waveform may lead to 
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"errors", the errors are predictably of the form sinx/x and can be compensated 
for. 

Referring to Fig. 1, the input to the DTMF detector is, illustratively, a 
sequence of 16-bit linearly quantized samples that are referenced to a sinusoidal 

5 signal whose peak to peak amplitude spans the entire dynamic range of the 16 
bit numbers corresponding to a signal power of 3 dBm. The input is fed into a 
framer 101, which collects N=105 samples for further processing. A frame of 105 
samples is fed to DPT 102 which computes the in-phase components 10 to 17; the 
quadrature components QO to Q7 and power PO to P7 at each of the 8 DTMF 

1 0 frequencies. The first few dot product multiplications of the sampled waveform by 
complex sinusoids is illustrated by the following lines of pseudo code. The Input 
Samples are in array lnputSamp!es[105]. The sine and cosines are stored in 
tables DFTCos[8]I105] and DFTSin[8][105]: 
for (Freq = 0; Freq < 8; Freq++) 

15 { 

lnPhase[Freq] = 0; 
Quadrature[Freq] = 0; 
for (1 = 0; i< 105; i++) 

{ 

20 lnPhase[Freq] += lnputSamples[i] * 

DFTCos[Freq]Ii]; 

Quadrature[Freq] += lnputSamples[i] * 
DFTSin[Freq]Ii]; 
} 

25 Power[Freq] = sqr(lnPhase[Freq]) + 

sqr(Quadrature[Freq]); 
} 

DFT 102 is need not be a complete DFT since it is necessary to only 
compute outputs at 8 of the 105 possible frequency bins. The input is also fed to 
30 Power computation block 1 03 which computes the short term power over the 1 05 
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sample window. It should be appreciated that for MF detection only 6 MF 
frequencies are involved rather than 8 for DTMF. 

The DFT outputs (I, Q, and P) are fed into sort function 104 which sorts 
the 8 tones in order of descending power and outputs the lA, IB, OA, QB and PA, 

5 PB values for the two highest power tones and the value of the third highest 
power tone PC. The i and Q values associated with the two highest power tones 
are fed into a frequency offset computation functions 105, 106 which determine 
the frequency deviation from the nominal frequency. The deviations PDA, PDB 
are computed by comparing the current phase with the phase of the previous 

10 frame, forming the phase change (Delta Phase). The expected or nominal phase 
change for the tone in question is subtracted from the computed phase change, 
resulting in the phase change deviation (PD), which is a measure of frequency 
deviation: 

P = atan2(Q, I) 

1 5 Delta Phase = PP - P (where PP = Previous Phase) 

PD = Delta Phase - Nominal Delta Phase. 
atan2 is a function that computes the arc-tangent using both the I and Q 
components in order to compute the angle within the full unit circle (-PI .. PI). 

The frequency deviation (PD) is fed into a power compensation block, 

20 which is used to adjust the computed power for the sin(x)/x characteristic of the 
DFT operation, where x is a measure of deviation from the nominal frequency. 

An example is in order. At our sampling frequency of 8000 Hz and frame 
size of 105 samples, the frequency bins are spaced at approximately 38 Hz. For 
example, the second bin, which is centered at 76 Hz, bin spans 76 +/- 38/2 Hz or 

25 57 Hz through 95 Hz. If the input frequency is 76 Hz, the deviation will be zero. If 
the input frequency is either 57 or 95 Hz, the deviation will be PI. (Recall that we 
took the absolute value to form the deviation.) At frequencies in between, the 
deviation is proportional to the frequency deviation 
Deviation = Frequency Deviation * PI/38 
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That having been said, we use this deviation to compensate the DFT 
output power for the Sin(x)/x rolloff of amplitude that is characteristic of the DFT 
within each of its frequency bins. Using our previous example, the "gain" 
characteristics of the DFT are as follows: 
5 G = sin{Deviation/2)/(Deviation/2) where Deviation is in radians. In order to 
compensate for this rolloff, we must multiply the resulting DFT amplitude by the 
inverse of this gain. The correction factor for the amplitude as a function of 
deviation is therefore: 

Correction = (Deviation/2) / sin(Deviation/2) 
1 0 The power conrection is the square of this number. 

After computing the phase deviation DPhase, the adjustment is illustrated 

by the following lines of pseudo code: 

Correction = ComputeSinXOverXCorrection{DPhase, Frequency); 
Power[Freq] = Correction * Power[Freq]; 

15 where ComputeSinXOverXCon-ection( ) is a function that computes the 
multiplicative correction of x/Sin(x) based upon the frequency deviation, which is 
represented by DPhase. Since the translation between DPhase and the 
frequency deviation is a function of the center frequency in question, the 
Frequency is fed to the ComputeSinXOverXCorrection function. 

20 What we are left with is a number of statistics feeding a Decision function shown 
in Fig. 2. 

The decision process of Fig. 2 comprises a number of validity checks. For 
DTMF detection, the first validity check verifies that one of the selected 
frequencies is one of the four lower DTMF frequencies and that the other 

25 selected frequency is one of the higher DTMF frequencies. (Of course, for MF 
detection there are not two different frequency bands to be verified). The second 
validity check verifies that the frequency deviation of each of the two selected 
frequencies is within tolerance. The third validity check verifies that the twist is 
within tolerance. The fourth validity check verifies that each of the two tones' 

30 powers exceeds the minimum requirement. The final validity check verifies that 
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the signal to noise ratio (SNR) exceeds the minimum requirement. The "Valid" 
(V) flag is the logical AND of all the aforementioned validity checks. An 
additional check is done to verify that the DTMF tone pair that was detected in 
the current frame is the same as that detected in the previous frame. The result 
5 of this check is the Equal (E) flag. The Valid and Equal flags are inputs into the 
state machine, Fig. 3, which uses this information in conjunction with the past 
state to make a final decision. The state machine ensures that the tone is present 
for the minimum duration. It also performs debouncing. The abbreviations used 
in Fig. 3 are as follows: 
1 0 V: Tone Pair Validity Check Passed 

E: Current tone index = previous tone index 

R: DTMF Detector Result Code 

HCNT: Hit Count 

MCNT: Miss Count 
1 5 MINHC: Minimum hit count 

MINMC: Minimum miss count 

HIT: Increment hit counter with MINHC as the maximum value. Set miss 
counter = 0 

MISS: Increment miss counter, with MINMC as the maximum. Set hit 
20 counter = 0. 

The state machine has 5 states, represented by the circles in the diagram. The 
states are defined as follows: 

• IDLE - No DTMF activity has been detected recently 

• VALID - The validity checks have passed 

25 • VALID, EQUAL - The validity checks have passed and the tone detected 
in the cun-ent window is equal to the tone that was detected in the 
previous window 

• ACTIVE - A valid DTMF digit has been detected and it is still active 

• TRAIL - The DTMF digit is trailing off. 
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With respect to the state machine VALID refers to the situation where all 
the validity checks pass in the current frame. When the hit count reaches 
MINHC, the active state is declared which indicates that a tone has been 
detected. (This is shown as R = DTMF_DIGIT). The state machine remains in the 

5 active state until the tone starts to trail off. The following result codes exist: 
DTMF_EARLY indicates that the beginning of a DTMF tone might be present; 
DTMF_DIGIT indicates that a tone has been detected and is still being detected; 
DTMF_TRAIL_EDGE indicates that the tone is trailing off; DTMFJDLE is the idle 
state; DTMF_FALSE_EARLY indicates that false detection of the start of a tone. 

10 The tone index (0..16 for DTMF) corresponding to the Row, Column pair. The 
tone Index is only valid if the two highest amplitude tones are a row and column 
(DTMF only). The Previous Tone Index is the Tone index from the previous 
frame. Thus, if the Tone Index (for DTMF detection) for the current frame differs 
from the Tone Index of the previous frame, valid tone cannot be declared. This is 

15 an extra check to reduce the probability of false detection; i.e., the DTMF tone 
has been valid and unchanged for the minimum pulse duration. A hit is defined 
as an occurrence of a valid DTMF tone in the current frame. The hit counter, 
HCNT is incremented each time a hit occurs, but HCNT will not be incremented 
above MINHC. MINHC is actually the threshold for declaring a valid DTMF digit 

20 detection. The terminology refers to the minimum number of hits required to 
detect a DTMF digit. A miss is defined as the lack of detection of a valid DTMF 
tone In the current frame. The miss counter, MCNT is incremented each time a 
miss occurs, but MCNT will not be incremented above MINMC. MINMC is 
actually the threshold for declaring a DTMF digit has ended. The terminology 

25 refers to the minimum number of misses required to declare that a DTMF digit 
has ended. The state machine is updated once per DFT window or frame. A 
state transition is indicated by an an"ow that starts at the current state and points 
to the next state. The state transition is labeled with text containing the event that 
triggers the transition and the action that takes place as a result. 
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The following pseudo-code describes the program flow of the DTMF 
detector software: 

DTMFDetect 

While there are more samples to process: 

5 Process samples using DFT function. (This function includes 

updating the total power) 
If processing of DFT window is complete 

Perform Post-DFT processing (DumpChannel( ) ) 
DumpChannel (Performs Post-DFT processing) 
10 Compute DFT bin powers 

Sort DFT frequencies in descending order of power, retaining the 3 
highest power 

Compute the tone index based upon 2 highest power frequencies 
Compute phase associated with 2 frequencies 
1 5 Compute phase changes with respect to previous DFT window 

Verify minimum power requirements 
Verify that frequency offeet is within specified requirements 
Correct powers for sin(x) / x based upon frequency deviation 
Subtract the two corrected powers from the total power to form the noise 

20 power 

Verify minimum power requirements based upon corrected powers 
Verify that SNR and twist requirements are met 

Verify that second highest power frequency sufficiently exceeds third 
highest power 

26 The Valid flag is set as the logical and of the aforementioned verifications 

Set the Equal flag if the current tone index is equal to the previous tone 

index 

Implement the DTMF Detector state machine. 
Update previous tone index, clear noise power. 
30 Return result. 
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Fixed Point Radian Scale Is Introduced as a way to express radian frequency 
using fixed point numbers. A fixed point radian number varies between -16384 
and 16384 as the radian varies between -PI and +PI. The conversion is therefore 
Fixed Point Radian = 16384/PI*Radian 

5 The Arc Tangent function is a transcendental function and is hence defined as a 
summation of an infinite set of rational numbers. It Is not practical to perform 
such an operation in real-time on a DSP. There are methods that can be used to 
approximate the arc tangent function with reasonable precision with a finite 
number of multiply, add, and divide operations. Unfortunately, these methods are 

10 best implemented using floating point arithmetic due to dynamic range issues, 
making these methods less than desirable on a fixed point DSP running a real- 
time application. 

The DTMF detector requires little precision from the arc tangent function 
because the specifications for frequency offset are loose enough that some error 
15 in the calculations can be tolerated. Therefore, the arc tangent of the quotient of 
the quadrature and in-phase components can be approximated by the quotient 
itself for small absolute values of the quotient. Thus, the approximation can be 
made that for -PI/4 <= x <= PI/4, arctan(x) = x. Given a set of rectangular 
coordinates I and Q (or adjacent and opposite), the corresponding angle in polar 
20 form is computed as Atan2(Q, I). Our approximation only accounts for -PI/4 <= 
Q/l <= pi/4, which is a mere quarter of the unit circle. As a result, we first take the 
approximate arctangent of abs(Q/l). This gives us an angle Thetal between 0 
and PI/4. 

A! = abs(l) 
25 AQ = abs(Q) 

Thetal = AQ/AI //Approximation 
Next, we make use of three pieces of information to compute the actual 
corresponding angle Theta within the range -PI .. PI: 
Sign of I 



KURTZ 2 - 2 



13 



Sign of Q 

Determining which of I and Q has greater absolute value 
Switch(l, Q, Al, AQ) 

I >= 0, Q >= 0, Al > AQ // 0 <= Theta <= PI/4 
5 Theta = Theta 1 

I >= 0, Q >= 0, Al < AQ // PI/4 <= Theta <= PI/2 

Theta = PI/2 - Theta 1 
l< 0, Q >= 0, Al > AQ // 3/4PI <= Theta <= PI 
Theta = PI - Theta 1 
10 l< 0, Q >= 0, Al< AQ //PI/2 <= Theta <= 3/4P1 

Theta = PI/2 + Theta 1 
I >= 0, Q < 0, Al > AQ // -PI/4 <= Theta <= 0 

Theta = -Theta 1 
I >= 0, Q < =, Al< AQ // -PI/2 <= Theta <= -PI/4 
15 Theta = -PI/2 + Theta 

l< 0, Q < 0, Al > AQ //-PI < Theta <= -3/4PI 

Theta = -PI + Thetal 
l< 0, Q < 0, Al< AQ //-3/4PI <= Theta <= -PI/2 
Theta = -PI/2 - Thetal 
20 What has been described is deemed to be illustrative of the principles of 

the invention. Further and other modifications may be made by those skilled in 
the art without, however, departing from the spirit and scope of the invention. 



